Audio Information Extraction from Arbitrary Sound Recordings

نویسندگان

  • Philip J. Duncan
  • Duraid Y. Mohammed
  • Francis F. Li
چکیده

Numerous archives of entertainment soundtracks and other recordings such as environmental noise samples have imposed a big data challenge in audio related industries. This necessitates the use of machine audition and retrieval tools to extract semantic information for various applications. Speech recognition, environmental noise classification and music information retrieval tools haven been developed in the past for specific purposes. Combined use of these tools to process arbitrary sound recordings remains challenging: overlap of diverse sources mitigates the classification, resulting in poor recognition and/or missing content. Following a review of a universal framework for arbitrary soundtrack information mining proposed by the authors, a new solution to the overlapped sound sources has been developed in this paper by iterative signal cleaning techniques. The system classifies the arbitrary audio signals into music, speech, ambient sounds and silence, allowing overlap. Validation tests have shown that the new techniques can reduce or eliminate information losses in machine audition, hence improving the usability of machine audition in processing real-world audio archives. This paper will also discuss the dataset and principles, present the validation results and discuss potential applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Supervised Learning Approach to Ambience Extraction from Mono Recordings for Blind Upmixing

A supervised learning approach to ambience extraction from onechannel audio signals is presented. The extracted ambient signals are applied for the blind upmixing of musical audio recordings to surround sound formats. The input signal is processed by means of short-term spectral attenuation. The spectral weights are computed using a low-level feature extraction process and a neural network regr...

متن کامل

A new approach to detecting auditory onsets within a binaural stream

The human auditory system is particularly sensitive to spatial information conveyed in the first two milliseconds of an auditory event. Therefore, in order to analyse a stream of binaural data in a perceptually relevant way, it is important to determine quickly and precisely the onset of each event within a data stream. This paper details the design of an auditory onset detector which is intend...

متن کامل

Features for Content-Based Audio Retrieval

Today, a large number of audio features exists in audio retrieval for different purposes, such as automatic speech recognition, music information retrieval, audio segmentation, and environmental sound retrieval. The goal of this paper is to review latest research in the context of audio feature extraction and to give an application-independent overview of the most important existing techniques....

متن کامل

Studies on Bird Vocalization Detection and Classification of Species

Aalto University, P.O. Box 11000, FI-00076 Aalto www.aalto.fi Author Seppo Fagerlund Name of the doctoral dissertation Studies on Bird Vocalization Detection and Classification of Species Publisher School of Electrical Engineering Unit Department of Signal Processing and Acoustics Series Aalto University publication series DOCTORAL DISSERTATIONS 166/2014 Manuscript submitted 12 June 2014 Date o...

متن کامل

Extraction and Removal of Percussive Sounds from Musical Recordings

Automated removal and extraction (isolation) of percussive sounds embedded in an audio signal is useful for a variety of applications such as speech enhancement and for music processing effects. A novel method is presented to accomplish both extraction and removal of beats, using an adaptive filter based on the LMS algorithm. Empirical evaluation is undertaken using computer generated music wit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015